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Abstract 

Given a set 71 = {Ri, R2, ■ ■ ■ , R n } of n randomly positioned axis parallel rectangles in 2D, the 
problem of computing the minimum clique cover (MCC) and maximum independent set (MIS) 
for the intersection graph G(7Z) of the members in TZ are both computationally hard [3]. For the 
MCC problem, it is proved that polynomial time constant factor approximation is impossible 
to obtain [15] . Though such a result is not proved yet for the MIS problem, no polynomial time 
constant factor approximation algorithm exists in the literature. We study the performance of 
greedy algorithms for computing these two parameters of G(TZ). Experimental results shows 
that for each of the MCC and MIS problems, the corresponding greedy algorithm produces a 
solution that is very close to its optimum solution. Scheinerman |16j showed that the size of 
MIS is tightly bounded by y/n for a random instance of the ID version of the problem, (i.e., 
for the interval graph) . Our experiment shows that the size of independent set and the clique 
cover produced by the greedy algorithm is at least and at most 3y/n, respectively. Thus 
the experimentally obtained approximation ratio of the greedy algorithm for MIS problem is at 
most I and the same for the MCC problem is at least | . Finally we will provide refined greedy 
algorithms based on a concept of simplicial rectangle. The characteristics of this algorithm may 
be of interest in getting a provably constant factor approximation algorithm for random instance 
of both the problems. We believe that the result also holds true for any finite dimension. 

Keywords: Minimum clique cover, maximum independent set, rectangle intersection graph, ap- 
proximation algorithm, empirical study 

1 Introduction 

Let G(TZ) be the intersection graph of a set TZ = {Ri, R2, ■ ■ ■ , Rn} of n randomly placed polygonal 
objects, e.g., rectangles, circles, polygons, etc., in a 2D region. In the practical applications two 
types of cliques in the geometric intersection graph are considered, namely (i) graphical clique and 
(ii) geometric clique. A graphical clique C in GiJZ) is a maximal complete subgraph of G(TZ). A 
geometric clique C consists of a maximal set of objects C C TZ such that they have a common 
point in their interior. Thus, a geometric clique is always a graphical clique, however the converse 
is not true. If TZ is a set of polygonal objects, then the problem of computing the minimum 
geometric clique cover is NP-hard [BJ. However, for a set TZ of axis-parallel rectangles, a graphical 
clique is always a geometric clique since the axis-parallel rectangles satisfy Helly property. Thus, 
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the minimum clique cover of the graph G(1Z) is same as the minimum number of points required 
to stab all the rectangles in 1Z. It is easy to show that the number of cliques present in G(1Z) 
may be 0(n 2 ) in the worst case [11]. But, if TZ is a set of c-oriented polygons (c > 5), the Helly 
property does not hold. Nilson [H] proved that the number of geometric clique in G(TZ) can be at 
most r(2, c)(f>(lZ) log 1 ^ 1 {(f>(TZ) + 1), where r(2,c) is the Gallai number of the pairwise intersecting 
c-oriented polygons and Let <p(TZ) denotes the packing number of 1Z, that is the maximum number 
of pairwise disjoint objects in 1Z. The same paper also provides a i(n,c) + 0{nc\og(4>(JZ))) time 
algorithm for computing the minimum geometric clique cover of G(7Z), where t(n,c) is the time 
required to pierce pairwise intersecting c-oriented polygons. 

In this note, we are interested in studying the performance of the greedy algorithm for computing 
the minimum clique cover (MCC) and maximum independent set (MIS) of a set G(1Z) of axis- 
parallel rectangles in 2D. The study on MCC and MIS problems on a random instance of interval 
graph have started long back by Scheinerman |16| . They showed a tight bound of y/n for both 
MCC and MIS problems on randomly generated intervals. Recently, Tran [17J formally proved 
that size of the MCC for G(1Z) can be at most 0{^/n log log n). He conjectured that size of the 
MCC is at most 0(y/n) for G(1Z). He also provided results for higher dimension of the problem. 
A lot of studies on MCC and MIS problems have been done for the intersection graph of a set of 
axis-parallel rectangles on a 2D plane. Aronov et al. [2] proposed a randomized polynomial time 
algorithm for the MCC problem using the concept of e-net that can produce O (log log n) factor 
approximation result. Later Pach et al. [15] showed that this is the best possible approximation 
for the MCC problem that can be achieved in polynomial time. For the MIS problem, the first 
approximation algorithm for arbitrary sized axis-parallel rectangles was proposed by Agarwal et 
al. [1] that produces a 0(logn)-factor approximation result in O(nlogn) time. The best known 
result for MIS problem is due to Chalermsook and Chuzhoy [3], which provides a polynomial time 
O (log log n) factor approximation algorithm for the said problem. A nice literature on MIS problem 
can be found in [3]. Although there is a lower-bound proof on approximation ratio of MCC problem, 
same for MIS problem is not known. Nielson [13] gives a construction which gives Sl(logn) bound 
for the greedy algorithm posed in [3] (see Figure [T]) . 

Halldorsson and Radhakrishnan [8j showed that for general graphs of bounded degree A, the greedy 
algorithm produces ^-^-factor approximation result for the MIS problem. It also proposes a simple 
parallel algorithm that runs in 0(log* n) time using linear number of processors. The approxima- 
tion factor can be improved to 2d + 3 using the fractional relaxation technique of Nemhauser and 
Trotter [12j . where d is the average degree of the graph. Finally, it shows that using the greedy 
strategy of removing all cliques of same size gradually improves the approximation ratio of the 
algorithm, and approximation factor is possible to achieve. Mestre |10] introduced the notion 
of fe-extendible systems and showed that for such a system the greedy algorithm produces a x- 
factor approximation result. They showed that maximum weight ^-matching forms a 2-extendible 
system, and hence greedy algorithm produces a ^ factor approximation result. Several other prob- 
lems, namely maximum profit scheduling, maximum asymmetric TSP, can be shown to satisfy the 
properties of fc-extendible system for some suitable k. 

2 Our contribution 

The main contribution of this paper is an empirical study showing that the greedy algorithms 
for each of the MCC and MIS problems produces very close result when compared with optimum 
solution for a random instance of the intersection graph of axis-parallel rectangles. Our experiment 
pushes the conjecture made by Tran [17] more towards affirmative side because the size of the clique 
cover produced by the greedy algorithm for the MCC problem on a random instance of rectangle 
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Figure 1: An example showing that approximation factor of the greedy algorithm is logn 

intersection graph is at most for reasonably big random instances and we believe that it will 
hold true for any such instances. Similarly, the greedy algorithm for the MIS problem also produces 
a very close solution to the optimum solution in the sense that the size of MIS produced by greedy 
algorithm is at least 2 x y/n. Finally we will provide two refined greedy algorithms and their 
characteristics that may be of interest to design constant factor approximation algorithms for the 
MCC and MIS problems where the participating rectangles are randomly placed and of random 
size. We strongly believe that the result also holds for the intersection graph of randomly generated 
axis-parallel rectangles in any finite dimension. We have also produced the result of Nielson's [T3] 
divide and conquer algorithm for the MCC problem. Nielson [13] has done some experimental study 
of his algorithm on the intersection graph of randomly generated rectangles and claimed that the 
approximation ratio of their algorithm is 3.42. We show that the size of the clique cover produce 
by our greedy algorithm is almost 2 times better than that of |13j . 

3 Algorithms 

We generate a set 1Z = {R%, R2, ■ ■ ■ , R n } of axis-parallel rectangles in a given rectangular region 
([a, b], [c, d]). Each rectangle is stored in the form of a pair of points indicating its bottom- left and 
top-right corners. In other words, for the rectangle Ri, we randomly choose a pair of points pi = 
(x p , y p ), qi = (x q , y q ) on the given rectangular region ([a, b], [c, d\). Without loss of generality assume 
that x p < x q . Now, if y p < y q , then Ri = [(x p ,y p ), (x g ,y q )]; otherwise Ri = [(x p ,y q ), (x q ,y p )]. Now, 
we introduce the concept of dominated rectangle. If a rectangle Ri contains some other rectangle 
Rji 3 7^ h then it is called a dominated rectangle. A dominated rectangle can be disregarded 
since it need not be considered while computing the minimum clique cover, and also it does not 
participate in the maximum independent set. After the generation of n random rectangles, let 7Z 
denote the set of all non-dominated rectangles. 

We use G{JZ) to denote the intersection graph of the members in TZ. Here the nodes correspond to 
the members in TZ. Between a pair of vertices there is an edge if the corresponding two rectangles 
share a common point in their interior. We use N(Ri) to denote the vertices adjacent to Ri in 
G{1Z) including itself. We now describe the two greedy heuristic algorithms for the MCC and 
MIS problems. We have also implemented the divide and conquer algorithm of Nielson p3] for 
the MCC problem to justify the approximation bound of our algorithms on the MCC problem 
for a random instant of rectangle intersection graph. We use this approximation bound to justify 
the approximation bound of our greedy algorithm for the MIS problem for a random instant of 
rectangle intersection graph. 
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Algorithm 1 Algorithm GCC(Te) 
1: Input: A set 72 of randomly generated n rectangles on a 2-dimensional plane. 
2: Output: A set GCC of points that stab all the rectangles in 72. 
3: Compute 72, the non-dominated set of rectangles in 72. 
4: Initialize GCC = 0. 
5: repeat 

6: Compute a maximum clique C among the rectangles in 72 using the algorithm of [9l 111) , and choose 

a point 7r in the common region of the members in C. 
7: Set 72 = 72 \C and GCC = GCC U {tt}. 
8: until 72 = 0. 

9: Return GCC as the stabbing points along with its cardinality. 



3.1 MCC problem 

Since the axis-parallel rectangles satisfy Helly property, the members of a clique in G(TZ) share a 
point in their interior. Thus, the minimum clique cover of the graph G(7Z) is same as the minimum 
number of points required to stab all the rectangles in it. 

3.1.1 Greedy algorithm 

Our greedy algorithm proceeds as follows. We perform a horizontal line sweep from top to bottom 
to compute the largest clique C of G(1Z). It is a point where maximum number of rectangles in 
it overlap. All these rectangles can be stabbed by a single point. We delete all the rectangles in 
C from 7Z and repeat the same process. The iteration continues until all the rectangles in it are 
deleted. The pseudo code of the algorithm is given below. 

Lemma 1. The worst case time complexity of the algorithm GCC is 0(n 2 log n). 

Proof. Follows from the fact that the largest clique of a rectangle intersection graph can be com- 
puted in 0(n log n) time [Sdl], and the number of iterations can be 0(n) in the worst case. □ 

3.1.2 An improved greedy algorithm 

We now introduce the concept of simplicial rectangle to present an improvement of the greedy 
algorithm GCC I for the MCC problem. A node in the graph G(1Z) is said to be simplicial if all 
the nodes adjacent to it form a clique. The corresponding rectangle will be referred to as simplicial 
rectangle. The concept of simplicial rectangle is very much similar to a simplicial vertex in a graph 
[7]. It needs to be noted that all the rectangles adjacent to a simplicial rectangle can be stabbed 
by a point along with the rectangle R. Algorithm [2] states the detailed procedure. 
Lemma 2. The worst case time complexity of the algorithm GCC A is 0(n 3 ). 

Proof. Each iteration of the repeat loop of the algorithm consists of two steps: (i) finding a simplicial 
rectangle, and (ii) finding the largest clique. 

Let N(R) denote the neighbors of R including itself in the graph G(1Z). In Step (i), the construction 
of the incidence matrix / needs 0(n 2 ) time. While searching for a simplicial rectangle, in each failure 
step it marks all the members of N(R) since there exists no other rectangle R' G N(R) which is 
simplicial. The proof is as follows. If N(R) = N(R') then surely R' is not simplicial. Otherwise 
there exists some rectangle R" such that R" £ N(R') but R" N(R); since R G N(R'), R' is 
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Algorithm 2 Algorithm GCCJ(Tl) 



1: Input: A set 1Z of n randomly generated rectangles on a 2D plane. 

2: Output: A set GCGJ. of points that stab all the rectangles in 1Z. 

3: Compute the non-dominated set of rectangles ft. 

4: Initialize 6 = 0; $ = 0. 

5: repeat 

6: R = FindJ3implicial(7£). 

7: if R ^ then 

8: Let N(R) be the rectangles adjacent to R including itself. 

9: Choose a point it in the region common to all the members in N(R). 

10: Set ft = ft\N(R) and e = 6U{7r}. 

11: else 

12: Compute a maximum clique C among the rectangles in ft using the algorithm of [9]lll|. and choose 

a point 7T in the common region of the members in C. 

13: Set ft = ft \ C and $ = $ U {tt}. 

14: end if 

15: until ft = 

16: return GCC Jl = O U $ as the stabbing points along with its cardinality. 



1: Procedure Find_Simplicial(A) 

2: Assumption: All the vertices in A are unmarked. 

3: Compute an incidence matrix / of the graph G(A); 

4: Sort the vertices of G(A) in increasing order of their degrees; 

5: while all the vertices in A are not marked do 

6: choose the vertex v having minimum degree among the unmarked vertices. 

Let N(v) be the set of vertices adjacent to v. N(v) includes the vertex v. 
7: if N(v) forms a clique then 

8: return R as a simplicial rectangle, and a point tt inside the common intersection region; 
9: else 

10: Mark the members of N(v) since they can not be simplicial. 
11: if a, b e N(v) and I[a, b] = then 

12: mark all the vertices u such that 7 [a, u] — 1 and I[b, u] = 1 

13: end if 
14: end if 

15: end while 

16: return 0. 



not simplicial. Moreover, if p, p' £ N(R) and I(p, p') = 0, then we have deleted all the rectangles 
having both p and p' as neighbors. Thus, each entry of the matrix / is accessed 0(1) time. 

Step (ii) needs 0(n log re) time in the worst case [9J [TT]. Since the number of iterations is O(re) in 
the worst case, the result follows. □ 



3.2 MIS problem 
3.2.1 Greedy algorithm 

Our MIS heuristic also depends on the concept of simplicial rectangle. If a simplicial rectangle R 
is found in G(1Z) (i.e., N(R) forms a clique), we can only choose R in the independent set among 
the set of rectangles N(R). Our algorithm is an iterative one. At each iteration, it searches for 
a simplicial rectangle. If such a rectangle R is found, it is included in MIS; otherwise, we delete 
a rectangle having maximum number of neighbors. The logic behind choosing such a rectangle is 
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Figure 2: The maximum degree rectangle removal may lead to non-optimal result 

that its absence may delete a neighbor of maximum number of rectangles. Note that, this may also 
be a simplicial rectangle if some of its adjacent rectangle is removed. In Figure [2] such a situation is 
demonstrated. Here none of the rectangles present in the region is simplicial due to the position of 
the other rectangles. Rectangle A has maximum number of neighbors. But removal of rectangle B 
makes it simplicial. However, the chance of such a rectangle to be simplicial is small. The pseudo 
code of the algorithm is given in Algorithm [3j 

Algorithm 3 Algorithm MIS (ft) 

1: Input: A set 7Z of n randomly generated rectangles on a 2D plane. 

2: Output: A set MIS of mutually non-overlapping rectangles. 

3: Compute 1Z, the non-dominated set of rectangles in 1Z. 

4: Initialize MIS = 0. 

5: repeat 

6: R = Find_Simplicial(ft). 
7: if R ^ then 

8: Let N(R) be the set of rectangles adjacent to R including itself. 
9: Set U = H \ N(R) and MIS = MIS U {R}. 
10: else 

11: Let R' be the rectangle having maximum degree. 

12: Set a = a \ {r'} 

13: end if 
14: until K = 

15: Return MIS along with its cardinality. 



Lemma 3. The worst case time complexity of the algorithm MIS is 0(n 3 ). 

Proof. Each iteration of the repeat loop of the algorithm consists of two steps: (i) finding a simplicial 
rectangle, and (ii) finding the rectangle having maximum degree. The result follows from the time 
complexity of Step (i) (see Lemma [2]). □ 



3.2.2 A variation of the Greedy algorithm 



In our greedy algorithm (subsection 3.2.1), at each iteration we searched for a simplicial rectangle. 
If such a rectangle is not found, then we removed the rectangle having maximum degree, and 
repeated the process. 

The algorithm proposed in this section is very similar to GCC-I algorithm. Here instead of re- 
moving the rectangle having maximum degree, we identified the largest clique and then removed 
all the rectangles participating in that clique. Finally, we report the set simplicial rectangles as the 
independent set. The time complexity of this algorithm remains same as that of GCC-I. 
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4 Experimental Studies 



We have performed a detailed experiment with different n (number of rectangles). For each n, 
we generated 20 different instances of n random rectangles as described in Section [3j For each 
instance, we run our proposed heuristics for both MCC and MIS problem, and also the divide and 
conquer heuristics of [13] for the MCC problem. In Table [T] and Figure [3j we refer this algorithm as 
DCC. Figure [3] shows the comparison of performance of our proposed two greedy heuristics GCC 
and GCC -I and the divide and conquer algorithm DCC of [13j for the MCC problem on rectangle 
intersection graph. It is observed that our both the algorithms produce result better that of |13j . 
We have also plotted 3y/n for different values of n in the same graph to demonstrate the solution 
produced by the improved greedy heuristic GCC J is always less than 3y/n. 
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Figure 3: Comparison of the quality of solution produced by our proposed heuristics GCC, GCC -I 
and the DCC algorithm of [13] for the clique cover of rectangle intersection graph 

In Figure [4] we demonstrate the performance of our proposed two greedy heuristics MIS and 
MIS J for the maximum independent set problem on rectangle intersection graph. We have also 
plotted 2y/n for different values of n in the same graph to demonstrate the solution produced by 
the greedy heuristic MIS is always less than 2y/n. But, the running time of our MIS J heuristic 
is much better than that of MIS. 

The final conclusion of our experimental study is summarized in Observation [TJ and is demonstrated 
in Figure [5] The justifications of getting such results are also explained. 

Observation 1. The solution produced by our greedy heuristics for the minimum clique cover 
(MCC) problem on an intersection graph of a set of randomly generated axis-parallel rectangles is 
at most 2 x OPTmcc > where OPTmcc i> s the size of the optimum solution of the same problem. 

Justification 1: Let OPTmis De the srze of the optimum solution of the maximum independent 
set (MIS) problem. We have gpjfo" 5 < 1. Figure lol shows that l ?Fr?J l < 1.5 < 2 for all 
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Figure 4: Comparison of the results produced by our proposed heuristics MIS and MIS J for the 
maximum independent set of a rectangle intersection graph 



Table 1: Experimental result 



n 


min. 


clique cover 


max. 


indep. set 


GCCJ /MIS 


DCCpLj 


GCC 


GCCJ 


MIS 


MISJ 


500 


87 


53 


48 


46 


41 


1.0434 


1000 


137 


77 


71 


64 


55 


1.1094 


5000 


367 


195 


180 


155 


124 


1.1613 


10000 


560 


281 


262 


219 


172 


1.1963 


15000 


709 


353 


327 


271 


210 


1.2066 


20000 


851 


414 


382 


313 


240 


1.2204 


25000 


971 


467 


432 


354 


270 


1.2203 


30000 


1080 


515 


478 


388 


294 


1.2320 


35000 


1183 


557 


514 


415 


311 


1.2386 


40000 


1282 


604 


559 


452 


341 


1.2367 


45000 


1369 


642 


596 


475 


356 


1.2547 


50000 


1456 


677 


628 


504 


375 


1.2460 
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Figure 5: Justification of Observation [T] 

the chosen values of n, where \GCC-I\ and \MIS\ are the size of the solution generated by 
our greedy heuristics GCC J and MIS for the MCC and MIS problems respectively. Again 
\OPTmcc\ < \CC\ for any arbitrary clique cover CC and \OPTmis\ — \IS\ for any arbitrary 
independent set IS of the given graph. Thus we have 

OPT MIS < OPT M cc < \CC\ < 2 x \IS\ < 2 x OPT MIS . 

Justification 2: In our experiment it is observed that J^^'j| < 1 for all the values of n we have 
chosen. During the execution of GCC -I we observed two parameters $ and VP, where <3? indicates 
the number of simplicial rectangles observed, and M/ indicates that the number of times we need 
to eliminate the largest clique without getting a simplicial rectangle. It is sure that the simplicial 
rectangles need to be stabbed by a point; so $ many points are essential. $ is the set of extra 
points used to stab the rectangles that are not stabbed by any point in and VP is less than <£. 
So, this gives an indication towards 2 factor approximation algorithm for the MIS problem on the 
randomly generated rectangle intersection graph. The indication could be well justified if after the 
elimination of each clique corresponding to a member in VP, we could get a simplicial rectangle. 
But the execution trace (which is not included in this note) does not demonstrate this fact. 

Justification 3: It is noticed that since the solution produced by both GCC and GCC_I algorithms 
for the clique cover problem is very close to lyfn. It is proved in [13J that the size of the optimum 
solution of the MCC problem is upper bounded by \fn for a set of n randomly positioned rectangles. 
Thus, the empirical evidences show that our algorithm produces 2 approximation result for large 
values of n. 
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5 Conclusion 



In this note, we experimentally analyze the performance of greedy algorithm for the minimum clique 
cover and maximum independent sets problems for rectangle intersection graphs. Exerimental 
result shows that it produces 1.5 factor approximation on the randomly generated instances of 
the corresponding problems. The intuitive justifications of such behavior may lead to a formal 
algorithm of getting a constant factor approximation results of the corresponding problems for 
random instances of rectangle intersection graph. 
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